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Purification Result for Endo- 0 -N-acetylglucosaminidase (15-25% gradient SDS-PAGE) 



Lane 1: Purified endo- & -N-acetylglucosaminidase from Mucor hiemalis 
Lane 2: Molecular weight markers 
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Reaction enzyme rnap-for^p^L-EiidQ including the full-length 
novel endo- 0 -N-acetylglueosaminidase gene. 
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FIG. 3 

10 20 30 40 50 60 

GTCGACCCAC OCGTCCOCCX3 ACGCGTOGGC GGACGCGTGG GOGGACGCGT GGGTTTTATT 

70 80 90 100 110 120 

TTACATAAAT ATGCCTTCAC TTCAATTGCA ACCTGATGAC AAACTAGCAC CIU ' ITIX-TIT 

130 140 150 160 170 . 180 

TOCACTTAAG TCTATGAATG AGTTGAGGGA CTGGACGCCA GACGAAAAGA TAAAGTTTAA 

190 200 210 220 230 .240 

CGTTTCAAGC GTGGCACTAC AGCCTCGTCT GAAAAACGCC CTGAAACCTC AATTATTGTT 

250 260 270 280 290 300 

AACTCATGAT ATGGCAGGAG GATATAAAGA AGATAAAAAT ATTCAAGGAA ACAATTATAA 

310 320 330 340 350 360 

AGACATTTAT AACATTCAAT ATTGGCATTT AGCTGATACT TTTGTATATT TCTCTCATGA 

370 380 390 400 410 420 

GCGAGTTAQC ATTCCTCCAG TCAATTGGAC AAATGCTTGT CATAGAAATG GTGTAAAGTG 

430 440 450 460 470 480 

TTTAGCTACT TTTTTAGTAG AAGGAAATAA CCAAATGCAT GAAATGGAAG CCTTGCTTCA 

490 500 510 520 530 540 

COGTCCACCT TTACTTAATA ACACTGACGA CCCTATGAGA TTATGGAGTC CGTATTATGC 

550 560 570 580 590 600 

AGACCAATEA GTTGCTATTG CTAAACACTA TOGTTITGAT GGCTGGTTGT TCAATATTGA 

610 620 630 640 650 660 

ATGCGAATTC TTTCCTTTTC CTACAAATCC AAAATICAAA GCTGAAGAGT TGGCAAAGTT 

670 680 690 700 710 720 

TCTACACTAT TTTAAGGAAA AATTGCATAA CGAAATACCT GGATCTCAAC TCATTIGGTA 

730 740 750 760 770 780 

CGACAGCATG ACAAATGAAG GAGAAATCCA CTGGCAGAAC CAGCTCACAT GGAAAAATGA 

790 800 810 820 830 840 

GTTATTTTTr AAAAACACGG ATGGTATTTT TTTGAATTAT TGGTGGAAAA AAGAATACCC 

850 860 870 880 890 900 

TGAAATQGCG CGTAGACTAG CTGAAGGAAT AGGTAGATCA GGTTTAGAAG TTTATTTTGG 

910 920 930 940 950 960 

TACAGATC7EA TOGGGAAGGC ATACTTATGG TCGCGCyll aC /X' TTCAAATCAT ATAAGGGTGT 

970 980 990 1000 1010 . 1020 

AAAAACTGCC TACTCTGCAA TGACATCTTC TGCATTATTT GGTATGGCAT GGACATACGA 

1030 1040 1050 1060 1070 1080 

GCATTTCGAA AAGTCTGAAT TTGAAAAGAT GGATCGTTTG TlTlUJlVm GTGGTAAATA 

1090 H OP 1110 1120 1130 1140 

CTCTGACTAT CXnfcCCCCAC CTCCTAAAAA CCCAGATGAC GAAAAAGAAG TAGAAAGCGA 

1150 1160 1170 1X80 1190 1200 

TGATAGTGAA GATGAGCTCA TGTACGGACA CAAGAAAGGT ATTGCTGACA CGGTAGAATC 

1310 1220 1230 1240 1250 1260 

TATTCCTCTA CCAGGAACAG ATTQGTTTGT TACCAATITT GATAGGGGCT TTGGAAATAG 

1270 1280 129 0 1300 1310 1320 

GTTTDVTTAT AGAGGAAAGA GATTACTTIC TCAGCCTTOG TCCCATTTAT CGCATCAAGC 

1330 1340 1350 1360 1370 1380 

TATTCTCCCC AATAAAAGCT ATCGAAATCC AGAGATTTAT CCCACTGATC AAAACATTAA 



Entire nucleotide sequence of the fragment inserted into the Sal I-Not I sites of pZL-Endo 
including the full-length novel endo- 0 -N-acetylglucosaminidase gene. 
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FIG. 4 




1390 1400 1410 1420 ldin mm 

AATCACTAGT TCTCTCGATT GCGATCATGG AGCmTCTT GGTGGAACCT CGCTTATTAT 

1450 1460 1470 1480 1490 iqnn 

CAAAGGCCAA CGTTICAATC ATAGAGAATC GCATGATCTT GAAACTGAAA TTAGTATACC 

1510 1520 1530 1540 1550 1S fin 

TCTGTATAAG CTITCATTAG ATGCTAGTAA AGGATGCTCA TTGCGTTATA TTTATAGAAG 

I 570 1580 1590 1600 1610 1 fi9f> 

TITGnTGATG AAAGATGTAA AGTTCACAGT AGCATGTCAC TTTTCGTTAA AAACAAACGA 

1630 1640 1650 1660 1670 16SO 

CTCAGTTAAT TTCTTCAAGG TATGGCAGCC AGATGAAAAT TTCTCTTITG AATATGATGA 

„ 1690 1700 1710 1720 1730 1740 

TOGAATGAGA GCCACITTTTA CAACIGAAAA TTCTACCGAA AGCAGATGCT TTTTATTACG 

1750 1760 1770 1780 1790 mnn 

TACAACAGAA GAAGATACAG GAGAAAATGA 1TGGATAACA AAAACTATTA ATGTGCCTGC 

18 1° 1820 1830 1840 1850 iflfin 

TGTICCAGAA GGAAGTCAAT TATACATTAC AAGACTTGAA GTGAGCGTAG TATTAGATAC 

1870 1SS0 1890 1900 1910 1920 

AGCTGGTTEA GTAGGTCTTG TTAATCAAGT TATTGCTTGC TTOGGATATA TTAGCATCAT 

1530 1940 1950 I960 1970 iqao 

ACCAACTATA AATTCTQGAA TAAAAACAGA TTCATCACGC ATTATTCAGG ATCTCTTTTG 

1990 2000 2010 2020 2030 2040 

GAAAGATCAG AAATATACCA AAATCGGAAA AGAAAGTTTA GACGACATAG CTCAAGAAGA 

2 °50 2060 2070 2080 2090 2100 

ACTTCATAGA TATTATGGAA CATTGAACTC GGAAAACACA GCAAATGTAG TAAACGCTIG 

2110 2120 2130 2140 2150 2160 

GGAGGAAAXA GATTACTACA AOGTTITITA CAAAGAAAGT GACGACTCTG CAACTCGCAT 

" -7° 2180 2190 2200 22 io 2220 

CirriTAGGA ACAGCATTCT ctaatcaatt tcgtgtatct gctttagata ttaitttatc 

2230 2240 2250 2260 2270 2280 

TAAGCTACCA AAGATAGTCA TTGAAGCTGT TAACAAAGAA GGATACATCT CTTCAAGTX3G 

2290 2 300 2310 2320 2330 2340 

TAGCATAGAT TTGTCATTAA ACTAGGACTT GAAATAAAAT ATTATGATAA AGAAAAAAAA 



2 350 2360 2370 

AAAAAAAAAA AAAAAAAAAG GGCGGCCGC. 



2380 



2390 



2400 



Entire nucleotide sequence of the fragment inserted into the Sal I-Not I sites of pZL-Endo 
including the full-length novel endo- /3 -N-acetylglucosaminidase gene. (Continued) 
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FIG. 5 




9 18 27 36 45 54 

ATC OCT TCA CTT CAA TTG CAA OCT GAT GAC AAA CTA GCA CCT GTT TCT TIT GCA 



M 



63 72 81 90 99 108 

CTT AAG TCT ATC AAT GAG TTG AGG GAC TOG ACG CCA GAC GAA AAG ATA AAG TTT 



M N E L R D W 



F 



117 126 135 144 153 162 

AAC GTT TCA ADC GIG GCA CTA CAG CCT CGT GTG AAA AAC GCC CTC AAA CCT CAA 

NV S SVALQ P R V K N A L K P~ Q~ 

171 180 189 198 207 216 

TTA TTG TTA ACT CAT GAT ATG GCA GGA GGA TAT AAA GAA GAT AAA AAT ATT CAA 

LLLTHD MAGGYKEDKNIQ 

225 234 243 252 261 270 

GGA AAC AAT TAT AAA GAC ATT TAT AAC ATT CAA TAT TOG CAT TEA GCT GAT ACT 

GN N Y K D I Y N I Q YWH LA D~T~ 

279 288 297 306 315 324 

TTT GTA TAT TTC TCT CAT GAG CGA GTT AGC ATT CCT CCA GTC AAT TOG ACA AAT 

FVYFSHERV SIPPVNWTN 

333 342 351 360 369 378 

GCT TCT CAT AGA AAT QGT GTA AAG TCT TTA GGT ACT TIT TEA GTA GAA GGA" AAT 



N 



N 



387 396\ 405 414 423 432 

AAC CAA ATC CAT GAA ATG GAA GCC TTG CTT CAC GGT CCA CCT TTA CTT AAT AAC 



N Q M H E M 



N N 



441 450 459 468 477 486 

ACT GAC GAC CCT ATC AGA TTA TOG ACT COG TAT TAT GCA GAC CAA TTA GTT GCT 

TDDPMRLWSPYYADQLVA 

495 504 513 522 531 540 

ATT GCT AAA CAC TAT GGT TTT GAT GGC TOG TTG TTC AAT ATT GAA TOC GAA TTC 

IAKHY. GFDGWLFNIECEF 

549 558 567 576 585 594 

TTT CCT TTT CCT ACA AAT CCA AAA TTC AAA GCT GAA GAG TTC GCA AAG TTT CTA 

FPFPTNPKFKAEELAKFL 

603 612 621 630 639 648 

CAC TAT TTT AAG GAA AAA TTC CAT AAC GAA ATA CCT GGA TCT CAA CTC ATT TOG 



HYFKE KLHN 



W 



657 666 675 684 693 702 

TAG GAC AGC ATC ACA AAT GAA GGA GAA ATC CAC TOG CAG AAC CAG CTC ACA TOG 

YDSM TNEGElHWQNQL~T~"w~ 



Amino acid sequence deduced from the novel Endo- /? -N-acetylglucosaminidase gene, and 
the nucleotide sequence of the DNA encoding this amino acid sequence. 
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711 720 729 738 747 756 

AAA AAT GAG TTA TIT TIT AAA AAC ACG GAT GGT ATT TIT TIG AAT TAT TOG TOG 

KNE LFFKNTDG IFLNYWW 

765 774 783 792 801 810 

AAA AAA GAA TAG CCT GAA ATG GOG CGT AGA GTA GCT GAA GGA ATA GGT AGA TCA 

KKEYPEMARRVAEGIGRS 

819 828 837 846 855 864 

GGT TEA GAA GTT TAT . TIT GGT ACA GAT GTA TGG GGA AGG CAT ACT TAT GGT GGC 



GLEVYFGT D VWGRHTYGG 

873 882 891 900 909 918 

GGT GGT TIC AAA TCA TAT AAG GGT GTA AAA ACT GCC TAC TCT GCA ATG ACA TCT 
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GCA TTA 
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GGT ATG 


GCA 
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TGG ACA 


TAC 
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GAG CAT 


TIC 
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GAA AAG 


TCT 
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GAA TIT 
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A L 


F 


G M 
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W T 
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E H 
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E K 
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E F 


GAA 


981 
AAG ATG 


GAT 


990 
OGT TIG 




999 
TGG TGT 


GGT 
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TAC 
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TCT GAC 


TAT 


1026 
CCT CCC 
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K M 
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R 1* 
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W C 


G 


G K 


Y 


S D 


Y 


P P 


CCA 


1035 
OCX* CCT 


AAA 


1044 
AAC CCA 


GAT 


1053 
GAC GAA 


AAA 
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GAA GTA 


GAA 


1071 
AGC GAT 


GAT 


1080 
AGT GAA 


P 


P P 


K 


N P 


D 


D E 


K 


E V 


E 


5 D 


D 


S E 


GAT 


1089 
GAG CMC 


ATG 


1098 
TAC GGA 


CAC 


1107 
AAG AAA 


GGT 


1116 
ATT GCT 


GAC 


1125 
ACG GTA 


GAA 


1134 
TCT ATT 


D 


E L 


H 


Y G 


H 


K K 


G 


I A 


D 


T V 


E 


S I 


CCT 


1143 
GTA CCA 


GGA 


1152 
ACA GAT 


TOG 


1161 
TIT GTT 


ACC 


1170 
AAT TIT 


GAT 


1179 
AGG GGG 


TTT 


1188 
GGA AAT 
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V P 


G 


T D 
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F V 
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N F 
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R G 


F 


G N 


AGG 


1197 
TIT TAT 


TAT 


1206 
AGA GGA 


AAG 


1215 
AGA TTA 


CTT 


1224 
TCT GAG 


CCT 


1233 
TGG TCC 


CAT 


1242 
TTA TOG 


R 


F Y 


Y 


R G 


K 


R L 


It 


S Q 


P 


W S 


H 


L S 


CAT 


1251 
CAA GCT 


ATT 


1260 
CTC CCC 


AAT 


1269 
AAA AGC 


TAT 


1278 
CGA AAT 


CCA 


1287 
GAG ATT 


TAT 


1296 
CCC ACT 


H 


Q A 


I 


L P 


N 


K S 


Y 


R N 


P 


E I 


Y 


P T 


GAT 


1305 
CAA AAC 


ATT 


1314 
AAA ATC 


ACT 


1323 
AGT TCT 


CTC 


1332 
GAT TGC 


GAT 


1341 
CAT GGA 


GCT 


1350 
TTT CTT 


D 


Q N 


I* 


K I 


T 


S S 


L 


D C 


D 


H G 


A 


F L 


GGT 


1359 
GGA ACC 


TOG 


1368 

err att 


ATC 


1377 
AAA GGC 


CAA 


1386 
CGT TTC 


AAT 


1395 
CAT AGA 


GAA 


1404 
TCG CAT 


G 


G T 


S 


L I 


I 


K G 


Q 


R F 


N 


H R 


E 


S H 


GAT 


1413 
GTT GAA 


ACT 


1422 
GAA ATT 


AGT 


1431 
ATA CCT 


CTG 


1440 
TAT AAG 


CTT 


1449 
TCA TTA 


GAT 


1458 
GCT ACT 
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Amino acid sequence deduced from the novel Endo- j3 -N-acetylglucosaminidase gene, and 
the nucleotide sequence of the DNA encoding this amino acid sequence. (Continued) 
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FIG. 7 



1467 1476 1485 1494 1503 1512 

AAA QGA TOC TCA TTG CGT TAT ATT TAT AGA ACT TTG TTG ATG AAA GAT GTA AAG 
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V K 


TTG 


1521 
ACA GTA 


CCA 


1530 
TGT CAC 


TTT 


1539 
TOG TTA 


AAA 


1548 
ACA AAC 


GAC 


1557 
TCA GTT 


AAT 


1566 
TTC TTC 


L 


T V 
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C H 
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S L 
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T N 
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S V 


N 


F F 


AAG 


1575 
GTA TOG 


GAG 


1584 
CCA GAT 


GAA 


1593 
AAT TTC 


TCT 


1602 
TTT GAA 


TAT 


1611 
GAT GAT 


GGA 


1620 
ATG AGA 
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V H 
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P D 


E 


N F 
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F E 
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D D 
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M R 


GCC 


1629 
ACT GTT 


ACA 


1638 
ACT GAA 


AAT 


1647 
TCT ACC 


GAA 


IDjO 

AGC AGA 


TGC 


TTT TTA 


TTA 


1674 
CGT ACA 


A 


T V 


T 


T E 


N 


S T 


E 


S R 


C 


F L 


L 


R T 


ACA 


1683 
GAA GAA 


GAT 


1692 
ACA GGA 


GAA 


1701 
AAT GAT 


TGG 


1710 
ATA ACA 


AAA 


1719 
ACT ATT 


AAT 


1728 
GTG OCT 



1737 1746 1755 1764 1773 1782 

OCT GTT CCA GAA GGA ACT CAA TTA TAG ATT ACA AGA CTT GAA GTG AGC GTA GTA 



1791 1800 1809 1818 1827 1836 

TTA GAT ACA OCT GGT TTA GTA GGT CTT GTT AAT CAA GTT ATT OCT TGC TTG GGA 



1845 1854 1863 1872 1881 1890 

TAT ATT AGC ATC ATA CCA ACT ATA AAT TCT GGA ATA . AAA ACA GAT TCA TCA CGC 



1899 1908 1917 1926 1935 1944 

ATT ATT CAG GAT CTC TTT TGG AAA GAT CAG AAA TAT ACC AAA ATC GGA AAA GAA 



1953 1962 1971 1980 1989 1998 

AGT TTA GAC GAC ATA OCT CAA GAA GAA GTT CAT AGA TAT TAT GGA ACA TTG AAC 



2007 2016 2025 2034 2043 2052 

TGG GAA AAC ACA CCA AAT GTA GTA AAC GCT TGG GAG GAA ATA GAT TAC TAG AAC 
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TTC 


2115 
TGT AAT 


CAA 


2124 
TTT CGT 


GTA 


2133 
TCT GGT 


TTA 


2142 
GAT ATT 


ATT 


2151 
TTA TCT 


AAG 


2160 
CTA CCA 
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C N 
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F R 


V 


S G 


L 


D I 


I 


L S 


K 


L P 


AAG 


2169 
ATA GTT 


ATT 


2178 
GAA GCT 


GTT 


2187 
AAC AAA 


GAA 


2196 
GGA TAC 


ATC 


2205 
TCT TCA 


AGT 


2214 
GGT AGC 
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I V 
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B A 


V 


N K 


E 


G Y 
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s s 
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G S 


ATA 
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GAT TTG 


TCA 
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TTA AAC 
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Amino acid sequence deduced from the novel Endo- 0 -N-aeetylglueosaminidase gene, and 
the nucleotide sequence of the DNA encoding this amino acid sequence. (Continued) 
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FIG. 8 

Novel endo* j3 -N-acetylglucosaminidase gene 



Not I EcoT22I EcoRI Sac I Bell Hind III Pvu II Not I 




2.5 kbp 



Structure of expression vector pGEndo-SC for the use in Saccharomyces cerevisiae, which 
comprises a novel endo- j3 -N-acetylglucosaminidase gene. 
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FIG. 9 
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Expression of endo- 0 -N-acetylglucosaminidase enzyme in yeast into which 
an endo- 0 -N-acetylglucosaminidase gene has been introduced. 

Lanes 1-3: Cellular extract of 5. cerevisiae YPH500 (pep4) into which 

an endo- 0 -N-acetylglucosaminidase gene has been introduced. 
Lane 4: Purified endo- 0 -N-acetylglucosaminidase derived from M. hiemalis 
Lane 5: Cellular extract of S. cerevisiae YPH500 (pep4) 
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